Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 4372 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 478.3 KiB |
| Average record size in memory | 112.0 B |
Variable types
| NUM | 14 |
|---|
201101 is highly correlated with 201012 and 11 other fields | High correlation |
201012 is highly correlated with 201101 and 11 other fields | High correlation |
201102 is highly correlated with 201012 and 11 other fields | High correlation |
201103 is highly correlated with 201012 and 11 other fields | High correlation |
201104 is highly correlated with 201012 and 11 other fields | High correlation |
201105 is highly correlated with 201012 and 11 other fields | High correlation |
201106 is highly correlated with 201012 and 11 other fields | High correlation |
201107 is highly correlated with 201012 and 11 other fields | High correlation |
201108 is highly correlated with 201012 and 11 other fields | High correlation |
201109 is highly correlated with 201012 and 11 other fields | High correlation |
201110 is highly correlated with 201012 and 11 other fields | High correlation |
201111 is highly correlated with 201012 and 11 other fields | High correlation |
201112 is highly correlated with 201012 and 11 other fields | High correlation |
201012 is highly skewed (γ1 = 60.33692839) | Skewed |
201101 is highly skewed (γ1 = 61.60481322) | Skewed |
201102 is highly skewed (γ1 = 60.07710631) | Skewed |
201103 is highly skewed (γ1 = 63.40516334) | Skewed |
201104 is highly skewed (γ1 = 63.94016045) | Skewed |
201105 is highly skewed (γ1 = 61.23983916) | Skewed |
201106 is highly skewed (γ1 = 61.96502909) | Skewed |
201107 is highly skewed (γ1 = 62.97100934) | Skewed |
201108 is highly skewed (γ1 = 57.26313437) | Skewed |
201109 is highly skewed (γ1 = 57.86935705) | Skewed |
201110 is highly skewed (γ1 = 62.33770402) | Skewed |
201111 is highly skewed (γ1 = 59.62841957) | Skewed |
201112 is highly skewed (γ1 = 52.26357689) | Skewed |
CustomerID has unique values | Unique |
201012 has 3423 (78.3%) zeros | Zeros |
201101 has 3588 (82.1%) zeros | Zeros |
201102 has 3573 (81.7%) zeros | Zeros |
201103 has 3351 (76.6%) zeros | Zeros |
201104 has 3472 (79.4%) zeros | Zeros |
201105 has 3292 (75.3%) zeros | Zeros |
201106 has 3320 (75.9%) zeros | Zeros |
201107 has 3378 (77.3%) zeros | Zeros |
201108 has 3391 (77.6%) zeros | Zeros |
201109 has 3070 (70.2%) zeros | Zeros |
201110 has 2947 (67.4%) zeros | Zeros |
201111 has 2661 (60.9%) zeros | Zeros |
201112 has 3685 (84.3%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-02 07:45:28.804262 |
|---|---|
| Analysis finished | 2022-11-02 07:45:50.074827 |
| Duration | 21.27 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 4372 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15299.67772 |
|---|---|
| Minimum | 12346 |
| Maximum | 18287 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 12346 |
|---|---|
| 5-th percentile | 12613.55 |
| Q1 | 13812.75 |
| median | 15300.5 |
| Q3 | 16778.25 |
| 95-th percentile | 17984.45 |
| Maximum | 18287 |
| Range | 5941 |
| Interquartile range (IQR) | 2965.5 |
Descriptive statistics
| Standard deviation | 1722.390705 |
|---|---|
| Coefficient of variation (CV) | 0.1125769272 |
| Kurtosis | -1.195793327 |
| Mean | 15299.67772 |
| Median Absolute Deviation (MAD) | 1483.5 |
| Skewness | 0.0009180495309 |
| Sum | 66890191 |
| Variance | 2966629.742 |
| Monotocity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 12346 | 1 | < 0.1% | |
| 16282 | 1 | < 0.1% | |
| 16295 | 1 | < 0.1% | |
| 16293 | 1 | < 0.1% | |
| 16292 | 1 | < 0.1% | |
| 16287 | 1 | < 0.1% | |
| 16284 | 1 | < 0.1% | |
| 16283 | 1 | < 0.1% | |
| 16281 | 1 | < 0.1% | |
| 16222 | 1 | < 0.1% | |
| Other values (4362) | 4362 | 99.8% |
| Value | Count | Frequency (%) | |
| 12346 | 1 | < 0.1% | |
| 12347 | 1 | < 0.1% | |
| 12348 | 1 | < 0.1% | |
| 12349 | 1 | < 0.1% | |
| 12350 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 18287 | 1 | < 0.1% | |
| 18283 | 1 | < 0.1% | |
| 18282 | 1 | < 0.1% | |
| 18281 | 1 | < 0.1% | |
| 18280 | 1 | < 0.1% |
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4631747484 |
|---|---|
| Minimum | 0 |
| Maximum | 317 |
| Zeros | 3423 |
| Zeros (%) | 78.3% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 317 |
| Range | 317 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.94215154 |
|---|---|
| Coefficient of variation (CV) | 10.67016619 |
| Kurtosis | 3853.444815 |
| Mean | 0.4631747484 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 60.33692839 |
| Sum | 2025 |
| Variance | 24.42486185 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=17)
| Value | Count | Frequency (%) | |
| 0 | 3423 | 78.3% | |
| 1 | 591 | 13.5% | |
| 2 | 194 | 4.4% | |
| 3 | 88 | 2.0% | |
| 4 | 37 | 0.8% | |
| 5 | 15 | 0.3% | |
| 6 | 9 | 0.2% | |
| 7 | 4 | 0.1% | |
| 11 | 2 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| Other values (7) | 7 | 0.2% |
| Value | Count | Frequency (%) | |
| 0 | 3423 | 78.3% | |
| 1 | 591 | 13.5% | |
| 2 | 194 | 4.4% | |
| 3 | 88 | 2.0% | |
| 4 | 37 | 0.8% |
| Value | Count | Frequency (%) | |
| 317 | 1 | < 0.1% | |
| 37 | 1 | < 0.1% | |
| 34 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 13 | 1 | < 0.1% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3376029277 |
|---|---|
| Minimum | 0 |
| Maximum | 240 |
| Zeros | 3588 |
| Zeros (%) | 82.1% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 240 |
| Range | 240 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.71304557 |
|---|---|
| Coefficient of variation (CV) | 10.99826235 |
| Kurtosis | 3973.552043 |
| Mean | 0.3376029277 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 61.60481322 |
| Sum | 1476 |
| Variance | 13.78670741 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=12)
| Value | Count | Frequency (%) | |
| 0 | 3588 | 82.1% | |
| 1 | 530 | 12.1% | |
| 2 | 160 | 3.7% | |
| 3 | 53 | 1.2% | |
| 4 | 19 | 0.4% | |
| 5 | 6 | 0.1% | |
| 6 | 5 | 0.1% | |
| 7 | 5 | 0.1% | |
| 12 | 3 | 0.1% | |
| 11 | 1 | < 0.1% | |
| Other values (2) | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3588 | 82.1% | |
| 1 | 530 | 12.1% | |
| 2 | 160 | 3.7% | |
| 3 | 53 | 1.2% | |
| 4 | 19 | 0.4% |
| Value | Count | Frequency (%) | |
| 240 | 1 | < 0.1% | |
| 12 | 3 | 0.1% | |
| 11 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 7 | 5 | 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3186184812 |
|---|---|
| Minimum | 0 |
| Maximum | 191 |
| Zeros | 3573 |
| Zeros (%) | 81.7% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 191 |
| Range | 191 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.979498485 |
|---|---|
| Coefficient of variation (CV) | 9.35130465 |
| Kurtosis | 3840.276917 |
| Mean | 0.3186184812 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 60.07710631 |
| Sum | 1393 |
| Variance | 8.877411223 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=11)
| Value | Count | Frequency (%) | |
| 0 | 3573 | 81.7% | |
| 1 | 555 | 12.7% | |
| 2 | 166 | 3.8% | |
| 3 | 40 | 0.9% | |
| 4 | 15 | 0.3% | |
| 5 | 14 | 0.3% | |
| 6 | 4 | 0.1% | |
| 8 | 2 | < 0.1% | |
| 15 | 1 | < 0.1% | |
| 191 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3573 | 81.7% | |
| 1 | 555 | 12.7% | |
| 2 | 166 | 3.8% | |
| 3 | 40 | 0.9% | |
| 4 | 15 | 0.3% |
| Value | Count | Frequency (%) | |
| 191 | 1 | < 0.1% | |
| 15 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| 6 | 4 | 0.1% |
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.453568161 |
|---|---|
| Minimum | 0 |
| Maximum | 364 |
| Zeros | 3351 |
| Zeros (%) | 76.6% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 364 |
| Range | 364 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5.577629074 |
|---|---|
| Coefficient of variation (CV) | 12.29722356 |
| Kurtosis | 4131.864275 |
| Mean | 0.453568161 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 63.40516334 |
| Sum | 1983 |
| Variance | 31.10994608 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=15)
| Value | Count | Frequency (%) | |
| 0 | 3351 | 76.6% | |
| 1 | 699 | 16.0% | |
| 2 | 200 | 4.6% | |
| 3 | 64 | 1.5% | |
| 4 | 27 | 0.6% | |
| 5 | 11 | 0.3% | |
| 7 | 5 | 0.1% | |
| 6 | 4 | 0.1% | |
| 8 | 4 | 0.1% | |
| 10 | 2 | < 0.1% | |
| Other values (5) | 5 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3351 | 76.6% | |
| 1 | 699 | 16.0% | |
| 2 | 200 | 4.6% | |
| 3 | 64 | 1.5% | |
| 4 | 27 | 0.6% |
| Value | Count | Frequency (%) | |
| 364 | 1 | < 0.1% | |
| 18 | 1 | < 0.1% | |
| 13 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 11 | 1 | < 0.1% |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3989021043 |
|---|---|
| Minimum | 0 |
| Maximum | 360 |
| Zeros | 3472 |
| Zeros (%) | 79.4% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 360 |
| Range | 360 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5.501363462 |
|---|---|
| Coefficient of variation (CV) | 13.79126207 |
| Kurtosis | 4179.344776 |
| Mean | 0.3989021043 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 63.94016045 |
| Sum | 1744 |
| Variance | 30.26499994 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=13)
| Value | Count | Frequency (%) | |
| 0 | 3472 | 79.4% | |
| 1 | 630 | 14.4% | |
| 2 | 171 | 3.9% | |
| 3 | 50 | 1.1% | |
| 4 | 25 | 0.6% | |
| 5 | 6 | 0.1% | |
| 7 | 5 | 0.1% | |
| 6 | 4 | 0.1% | |
| 8 | 4 | 0.1% | |
| 10 | 2 | < 0.1% | |
| Other values (3) | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3472 | 79.4% | |
| 1 | 630 | 14.4% | |
| 2 | 171 | 3.9% | |
| 3 | 50 | 1.1% | |
| 4 | 25 | 0.6% |
| Value | Count | Frequency (%) | |
| 360 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 10 | 2 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 8 | 4 | 0.1% |
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4945105215 |
|---|---|
| Minimum | 0 |
| Maximum | 313 |
| Zeros | 3292 |
| Zeros (%) | 75.3% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 313 |
| Range | 313 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.851934125 |
|---|---|
| Coefficient of variation (CV) | 9.811589267 |
| Kurtosis | 3939.913076 |
| Mean | 0.4945105215 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 61.23983916 |
| Sum | 2162 |
| Variance | 23.54126476 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=16)
| Value | Count | Frequency (%) | |
| 0 | 3292 | 75.3% | |
| 1 | 683 | 15.6% | |
| 2 | 247 | 5.6% | |
| 3 | 75 | 1.7% | |
| 4 | 32 | 0.7% | |
| 5 | 15 | 0.3% | |
| 7 | 8 | 0.2% | |
| 6 | 7 | 0.2% | |
| 9 | 4 | 0.1% | |
| 8 | 3 | 0.1% | |
| Other values (6) | 6 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3292 | 75.3% | |
| 1 | 683 | 15.6% | |
| 2 | 247 | 5.6% | |
| 3 | 75 | 1.7% | |
| 4 | 32 | 0.7% |
| Value | Count | Frequency (%) | |
| 313 | 1 | < 0.1% | |
| 25 | 1 | < 0.1% | |
| 20 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 15 | 1 | < 0.1% |
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4602012809 |
|---|---|
| Minimum | 0 |
| Maximum | 305 |
| Zeros | 3320 |
| Zeros (%) | 75.9% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 305 |
| Range | 305 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.708916056 |
|---|---|
| Coefficient of variation (CV) | 10.23229672 |
| Kurtosis | 4004.945288 |
| Mean | 0.4602012809 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 61.96502909 |
| Sum | 2012 |
| Variance | 22.17389042 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=15)
| Value | Count | Frequency (%) | |
| 0 | 3320 | 75.9% | |
| 1 | 719 | 16.4% | |
| 2 | 188 | 4.3% | |
| 3 | 73 | 1.7% | |
| 4 | 36 | 0.8% | |
| 5 | 12 | 0.3% | |
| 6 | 8 | 0.2% | |
| 7 | 6 | 0.1% | |
| 8 | 4 | 0.1% | |
| 16 | 1 | < 0.1% | |
| Other values (5) | 5 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3320 | 75.9% | |
| 1 | 719 | 16.4% | |
| 2 | 188 | 4.3% | |
| 3 | 73 | 1.7% | |
| 4 | 36 | 0.8% |
| Value | Count | Frequency (%) | |
| 305 | 1 | < 0.1% | |
| 20 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4407593779 |
|---|---|
| Minimum | 0 |
| Maximum | 334 |
| Zeros | 3378 |
| Zeros (%) | 77.3% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 334 |
| Range | 334 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5.129520269 |
|---|---|
| Coefficient of variation (CV) | 11.63791521 |
| Kurtosis | 4093.453966 |
| Mean | 0.4407593779 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 62.97100934 |
| Sum | 1927 |
| Variance | 26.31197819 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=13)
| Value | Count | Frequency (%) | |
| 0 | 3378 | 77.3% | |
| 1 | 666 | 15.2% | |
| 2 | 194 | 4.4% | |
| 3 | 77 | 1.8% | |
| 4 | 33 | 0.8% | |
| 5 | 8 | 0.2% | |
| 8 | 5 | 0.1% | |
| 6 | 5 | 0.1% | |
| 17 | 2 | < 0.1% | |
| 13 | 1 | < 0.1% | |
| Other values (3) | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3378 | 77.3% | |
| 1 | 666 | 15.2% | |
| 2 | 194 | 4.4% | |
| 3 | 77 | 1.8% | |
| 4 | 33 | 0.8% |
| Value | Count | Frequency (%) | |
| 334 | 1 | < 0.1% | |
| 17 | 2 | < 0.1% | |
| 13 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 8 | 5 | 0.1% |
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3973010064 |
|---|---|
| Minimum | 0 |
| Maximum | 193 |
| Zeros | 3391 |
| Zeros (%) | 77.6% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 193 |
| Range | 193 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.059947386 |
|---|---|
| Coefficient of variation (CV) | 7.701836483 |
| Kurtosis | 3593.620868 |
| Mean | 0.3973010064 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 57.26313437 |
| Sum | 1737 |
| Variance | 9.363278003 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=16)
| Value | Count | Frequency (%) | |
| 0 | 3391 | 77.6% | |
| 1 | 682 | 15.6% | |
| 2 | 195 | 4.5% | |
| 3 | 49 | 1.1% | |
| 4 | 23 | 0.5% | |
| 5 | 12 | 0.3% | |
| 6 | 5 | 0.1% | |
| 9 | 3 | 0.1% | |
| 7 | 3 | 0.1% | |
| 10 | 2 | < 0.1% | |
| Other values (6) | 7 | 0.2% |
| Value | Count | Frequency (%) | |
| 0 | 3391 | 77.6% | |
| 1 | 682 | 15.6% | |
| 2 | 195 | 4.5% | |
| 3 | 49 | 1.1% | |
| 4 | 23 | 0.5% |
| Value | Count | Frequency (%) | |
| 193 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 15 | 1 | < 0.1% | |
| 13 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% |
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5322506862 |
|---|---|
| Minimum | 0 |
| Maximum | 250 |
| Zeros | 3070 |
| Zeros (%) | 70.2% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 250 |
| Range | 250 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 3.952082373 |
|---|---|
| Coefficient of variation (CV) | 7.425227388 |
| Kurtosis | 3636.515431 |
| Mean | 0.5322506862 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 57.86935705 |
| Sum | 2327 |
| Variance | 15.61895508 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=18)
| Value | Count | Frequency (%) | |
| 0 | 3070 | 70.2% | |
| 1 | 901 | 20.6% | |
| 2 | 245 | 5.6% | |
| 3 | 90 | 2.1% | |
| 4 | 32 | 0.7% | |
| 5 | 9 | 0.2% | |
| 6 | 8 | 0.2% | |
| 7 | 4 | 0.1% | |
| 10 | 2 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| Other values (8) | 9 | 0.2% |
| Value | Count | Frequency (%) | |
| 0 | 3070 | 70.2% | |
| 1 | 901 | 20.6% | |
| 2 | 245 | 5.6% | |
| 3 | 90 | 2.1% | |
| 4 | 32 | 0.7% |
| Value | Count | Frequency (%) | |
| 250 | 1 | < 0.1% | |
| 39 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 14 | 1 | < 0.1% |
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6031564501 |
|---|---|
| Minimum | 0 |
| Maximum | 375 |
| Zeros | 2947 |
| Zeros (%) | 67.4% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 375 |
| Range | 375 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 5.777790144 |
|---|---|
| Coefficient of variation (CV) | 9.579256165 |
| Kurtosis | 4036.421673 |
| Mean | 0.6031564501 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 62.33770402 |
| Sum | 2637 |
| Variance | 33.38285894 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=16)
| Value | Count | Frequency (%) | |
| 0 | 2947 | 67.4% | |
| 1 | 979 | 22.4% | |
| 2 | 281 | 6.4% | |
| 3 | 87 | 2.0% | |
| 4 | 40 | 0.9% | |
| 5 | 17 | 0.4% | |
| 11 | 5 | 0.1% | |
| 6 | 4 | 0.1% | |
| 8 | 4 | 0.1% | |
| 9 | 2 | < 0.1% | |
| Other values (6) | 6 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 2947 | 67.4% | |
| 1 | 979 | 22.4% | |
| 2 | 281 | 6.4% | |
| 3 | 87 | 2.0% | |
| 4 | 40 | 0.9% |
| Value | Count | Frequency (%) | |
| 375 | 1 | < 0.1% | |
| 29 | 1 | < 0.1% | |
| 21 | 1 | < 0.1% | |
| 17 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% |
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7918572736 |
|---|---|
| Minimum | 0 |
| Maximum | 377 |
| Zeros | 2661 |
| Zeros (%) | 60.9% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 377 |
| Range | 377 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 5.897752096 |
|---|---|
| Coefficient of variation (CV) | 7.447998891 |
| Kurtosis | 3791.451736 |
| Mean | 0.7918572736 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 59.62841957 |
| Sum | 3462 |
| Variance | 34.78347978 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=18)
| Value | Count | Frequency (%) | |
| 0 | 2661 | 60.9% | |
| 1 | 1037 | 23.7% | |
| 2 | 382 | 8.7% | |
| 3 | 156 | 3.6% | |
| 4 | 62 | 1.4% | |
| 5 | 29 | 0.7% | |
| 6 | 18 | 0.4% | |
| 7 | 8 | 0.2% | |
| 8 | 6 | 0.1% | |
| 13 | 3 | 0.1% | |
| Other values (8) | 10 | 0.2% |
| Value | Count | Frequency (%) | |
| 0 | 2661 | 60.9% | |
| 1 | 1037 | 23.7% | |
| 2 | 382 | 8.7% | |
| 3 | 156 | 3.6% | |
| 4 | 62 | 1.4% |
| Value | Count | Frequency (%) | |
| 377 | 1 | < 0.1% | |
| 48 | 1 | < 0.1% | |
| 40 | 1 | < 0.1% | |
| 22 | 1 | < 0.1% | |
| 13 | 3 | 0.1% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2321591949 |
|---|---|
| Minimum | 0 |
| Maximum | 94 |
| Zeros | 3685 |
| Zeros (%) | 84.3% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 94 |
| Range | 94 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.536809724 |
|---|---|
| Coefficient of variation (CV) | 6.619637548 |
| Kurtosis | 3173.047648 |
| Mean | 0.2321591949 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 52.26357689 |
| Sum | 1015 |
| Variance | 2.361784127 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 3685 | 84.3% | |
| 1 | 536 | 12.3% | |
| 2 | 98 | 2.2% | |
| 3 | 39 | 0.9% | |
| 4 | 5 | 0.1% | |
| 6 | 3 | 0.1% | |
| 5 | 3 | 0.1% | |
| 9 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 94 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3685 | 84.3% | |
| 1 | 536 | 12.3% | |
| 2 | 98 | 2.2% | |
| 3 | 39 | 0.9% | |
| 4 | 5 | 0.1% |
| Value | Count | Frequency (%) | |
| 94 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 6 | 3 | 0.1% | |
| 5 | 3 | 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| CustomerID | 201012 | 201101 | 201102 | 201103 | 201104 | 201105 | 201106 | 201107 | 201108 | 201109 | 201110 | 201111 | 201112 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 12346 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1 | 12347 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
| 2 | 12348 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| 3 | 12349 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| 4 | 12350 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 5 | 12352 | 0 | 0 | 1 | 7 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 1 | 0 |
| 6 | 12353 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7 | 12354 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8 | 12355 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 9 | 12356 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
Last rows
| CustomerID | 201012 | 201101 | 201102 | 201103 | 201104 | 201105 | 201106 | 201107 | 201108 | 201109 | 201110 | 201111 | 201112 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4362 | 18273 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 |
| 4363 | 18274 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 |
| 4364 | 18276 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 0 |
| 4365 | 18277 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| 4366 | 18278 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| 4367 | 18280 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 4368 | 18281 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| 4369 | 18282 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 1 |
| 4370 | 18283 | 0 | 2 | 1 | 0 | 1 | 1 | 2 | 2 | 0 | 1 | 1 | 4 | 1 |
| 4371 | 18287 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 2 | 0 | 0 |